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Oil Content and Composition Enhancement by Targeted/Non-Targeted 
Expression of Plant Plastidial and Bacterial Gene For Glycei^C^3- 
Phosphate Acyltransferase 

BACKGROUND ORTHE INVENTION 

There is a general Industrial demand for higher oil content in living organisms 
in order to be more cost effective during production and procestjing. It is 
estimated that a 1% increase in seed oil would add S20 M to the value of the 
Canadian canola crop at the farm gate. 



One way of accomplishing It is by modifying metabolic pathways of hving 
Qi organisms by enhancing the expression of genes involved in oil synthesis. 

For example, overexpression of a yeast SLC1'1 g&m has been found to 

OS 

pji Increase oil content in plants (Zou et a!., 1997)- 

Lnj Only a limited number of genes have been reported to enhance oil content 

^_ (for example. Zou et a!., 1 997, Mariilia et ah, 1 998). There is a need to 

mi explore other approaches using other gfenes which might be more effective 

alone or In combination with previously cloned genes. 

□ 

S $UM?^RY OF THE INVENTION 



We propose to increase seed oil content and modify seed oil composition by 
overexpressing the genes for the enzyme glycerol^-phosphate 
acyltransferase (GPAT). This we achieved earlier in yeast (See US 
Provisional Patent Application No. 60y 139.788, June 21 , 1999). We now 
report the extension of our invention to plant oils. 

This is the firet demonstration of an increase in carbon flux into plant 
triglycerides by manipulating the expression of a OPAT enzyme. 
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GPAT is the first enzyme of membrane and storage lipid (triayc!gfyceroI;TAG) biosynthesis in 
Irving organisms. II determines the type of fatty acid Incorporated in the sn-1 position of 
membrane and storage lipids. Overall, its spedficity for various acyl-CoA appears to be 
flexible (Frantzen, 1993; Frentzen and WoUer, 199B). Since it is the first ^tep in triglyceride 
and phospholipid lipid biosynthesis, rt is sumrtised that enhancing production of GPAT might 
also result in an increase of l-acylgYycerol-3-phosphate through the Kennedy pathway, 
resulting cn higher oil content. Three types of GPAT have been reported iVom plants: plastidial 
(P). mitochondrial (M), and cytosottc (£R). They exhibit variable spedficity toward ac/t-CoA 
and acyl-ACP substrates (Frentzen, 1993), The ER form of GPAT is considered to be the 
most important for TAG manipulation but it has not been possible to isolate a gene for this 
enzyme. However^ cONAs for P-GPATs has been isolated and characterized from a number 
of species. Also, heterologous GPAT genes from bacteria and mammals are avai^ble (see 



g; Wilkison and Bell. 1997; Dircks and Sul, 1997). 



A plastidial GAPT gene has been used by others to change the fatty acid composition of 
membrane lipids and to improve chilling tolerance (Murata, 1992 and Nishizawa, 1996). The 
bacterial GPAT gene (pIsB) has also been used to change the fatty acid composition of 
men^brane lipids and to decrease chilling tolerance (Wolter, 1 992). None of the studies thus 
far have used GPAT genes to increase oil content in a living organism. 



Because of the unavailability of an £R GPAT gene from plants, we set out to use a plant 
^1! plastidial GPAT and a bacterial GPAT gene to establish Uie piool of our concept. Here we 

^ demonstrate the use of a plant plastidial or bacterial GPAT gene to increase oil content in plant 

^ seeds. Two genes were modified to create four versions to achieve higher oil content. 



A cONA (-gene) for a P-GPAT has been isolated in this lab from safflower (Carf^amus 
trnctorius) (Bhella and MacKenzie, 1994). A P-GPAT can use both acyl-ACP and acyl-CoA 
substnates^ although the latter with lesser efficiency (Frentzen. 1993), It was surmised that in 
the absence of any acyl-ACP in the cytosol. a P-GPAT will have to use acyl-CoA, although it 
has to compete with the ER-GPAT for substrates, acyl-CoAs and giyceml-3-phosphate (G-3- 
P), Its competitive ability could be enhanced by overexpressing it by using a strong 
constitutive or a tissue specific promoter. Normally, a P-GPAT is targeted to plastids by a 
transit peptide (TP). The TP was deleted to confne the GPAT activity in the cytosol (P-GPAT- 
TP). The modifad gene was labelled as ctpgpai-lp. It is anticipated that an ER environment 
may also be required for a P-GPAT-TP to effectively participate in TAG biosynthesis. The ER 
retention signals (ERRS; Jackson et aL. 1990) were added to to P-GPAT-TP to meet this 
requirement. The modified gene was labelled as ctpgpat-tp-^Btrs, 

The Eschenchia cofi'spfsB gene has been previously characterized (seis Wilkison and Be(l. 
1997). The pfeS can use both acyl-ACP and acyl-CoA equally effectively. The p/sS gene was 
used as such and also with an added ERRS sequence. The gene constructs were labelled as 
ptsB and pls8-^errs. 
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EXPERIMENTAL 

Preparation of chimenc genes and expression vectors 

An open reading frame {orf; -1.1 Kb) wicnout tp was amplified by PCR from ihe ctpgpBt cONA. 
Another chimeric gene containing an errs at its 3' end was al$o PCR amplifrad. The orf of the 
E.coli pisB gene (-2-5 Kb) was PCR amplined in unmodified form or wrth an errs at its V end. 
The bJunUend PCR fragments generated using Pfu DNA polymerase were c)oned into pSK 11 
cloning vector. The genes were labeled as ctpgpaNp, otpgpaNp+errs, pIsB and pIsB-^errs, 
The intact ctpgpat cONA was also used (Bhella and MacKenzie, 1994). Their sequences are 
shown in, respectively, Figures l to 5. U should be noted that the sequances representing the 
ctpgpat are different from that published earlier (Bhella and MacKenzie, 1994). There were 
errors in the published sequence. 

For plant expression study, chimeric genes were recovered from the cloning vector and cloned 
into the plant transformation vector, pHS737 (tandem 35s CaMV promoter) for constitutive 
expression of GPATs, The recombinants were transformed into Agrobacterium tumefacions 
G GV 3101 for Arabidopsis transformation. 

SJi Transformation of Arabicfopsis thaliana 

nj Arabidopsis thaliana plants were transformed by the floral dip method. Seeds (Tt) from these 

pfants were collected and selected on a growth medium containing kanamydn. Transgenic 
O plants were grown to maturity and seeds (Ti) from 10 individual plants were collected and used 

yii for lipid analysis. V>^id type and plants transformed with vector alone were grown as controls 

s ' along with the transformed plants. 

Upid analysis 

L^X The fatty acid composition of the seeds was determined by GC anatysis roliowing extraction of 

g^, the oit and conversion of the triglycerides to fatty acid methyl esters. A known amount of CIS 

iJ: triglyceride was added to the seed sample as a marker before oil extraction. Total seed lipid 

^ content was estimated on the basis of the recovery of Cl5 fatty acid metliyJ ester. 



RESULTS AND DISCUSSION: 

Most of the transgen^ plants appeared nonmal in morphology. Tliose that were not were 
discarded. .TTie oil compositions and seed sizes of selected samples representing each 
contruct are shown in Table 1 and Rguns 5. Seeds of plants Iransfomied using only the 
PHS737 vector were indistinguishable in oil content from wild type control ptants grown under 
the same conditions. All other gene constructs produced higher seed oil content. The 
unmodified spgpat, which is expressed in the plastid, produced oil increases ranging from 10 
to 21%. suggesting that /ysaphosphatidic acid is released from the plasUds. On average the 
greatest increase in oil was observed in seeds of transformants carrying the spgpat-tp gene 
(average +22%), Addition of an ER targeting sequence had no apparent effect on seed oil 
content The pisB gene increased seed oil content by an average of 15%, The addition of an 
ER targeting sequence resulted in an average seed on increase of 18% but the difference 
cannot be regarded as significant- It is somewhat unexpected thai Ihe spgpai protein, which 
ordinarily acts on an acyf-ACP substrate and has a much reduced actrvHy with acyl-CoA 
substrates, effected a greater seed oil increase (on average) than the plsB protein. The pisB 
protein reacts equally well with acyl-ACP and acyl-CoA substrates. 
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Seeds of plants transformed with the vector only did not differ signfficanUy in average weight 
from wild type plants (Table 1). Seeds of individual plants fmm each conlrucl were 
significantiy larger than wnd type and the pHS737 control; e.g. 315-2, 301-2. 302-6, 303-3 and 
304-15. However, increased seed oil content was not always posrtively conelated with 
increased seed weight; e.g. 303-7 and 304-1. Samples representing spgpe\t genes onty were 
more highly correlated (coefficient 0.53) than those representing pIsB genes (coefficient 0.18). 

The phenotypes representing increased seed oil content and size would result in increased 
yields in oilseed crops. Those representing an increase In seed oil content without an increase 
in seed siie would provide mora oi) per tonne of seed and would present advantages to 
oilseed processors. Appropriate phenotypes can. therefore, be chosen for specific 
applications. 

We have demonslrated thai expression of both a bacterial and a plant GPAT increases plant 
seed oil content and seed ske. 

O 

Some compositional changes were also observed in the seed oils of the Iransformaftts. 
SIj Selected examples are Illustrated in Table 2. Seed oils from plants transformed with the vector 

nil alone {pHS737) were not significantly different from the unlransformed wild type A. ihaftana, 

^ Ho significant changes were obsenred in the proportions of any of the saturated fatty acids or 

O of 18:2 and 20:1, However, the proportions of 18:1 Z9 (deic acid) decreased and of 18:3 Z9, 

Ull 212, Z15 (a^linolenic acid) Increased in some of the indhndual transformanls of all constructs. 

5 Examples are 315-6, 315-8, 301-2. 302-5, 303-3 and 304-4. These compositional changes do 

Q not always coindda with an increase in total seed oi) content: for example, 315-6. 

nj; Furthennore, increases in seed oil content are not always accompanied by a change in 

composition: for example 315-4 and 303-7, 

p: We hypothosire that overexpresslon of GPAT increases the proportion of lyso-phosphafidic 

ri! add which contains oleic acid at the sn-l position and, therefore. In turn, the proportion of 

phosphalidic acid having the same sn-1 fatly add composition. The latter constitutes the 
sutistrate for subsequent desaturation by the A-12 fatty acid desaturase to produce tinoleic 
acid which is subsequently further desaturated by the A-15 fatty acid desaturase. The result is 
increased proportions of a-linolenic acid, the temiinal product of the des.ituration pathway. 

It is surprising that the change in the proportions of oleic and a-linolenlc adds is observed for 
both the safflower and E, co// GPATs. )n in vitro assays, the safflower plastidial GPAT prefers 
unsaturated fatty acids as compared to the E. coZ/GPAT which prefers s aturated fatty adds. 
Nevertheless, both genes, in modified and unmodified forms, present an approach to 
decreasing the proportion of oleic acid and increasing the proportion of iinolenic add in 
contexts where such a result is desirable. In the latter context, the process is independent of 
any intelleaual property rights vested in any of the fatty acid desaturascis. 

Since GPAT is a ubiquitous enzyme for phospholipid and triglyceride biosynthesis and the 
biochemical pathway is identical in all living organisms, it is reasonable to expect that the 
overexpression of these genes will also enhance oil synthesis in other microorganisms and 
other plant species. It is also reasonable to expect that expression of other GPATs will have 
similar effects on seed oil content. 



Summary 

An increase in triglyceride synthesis and average seed size following expression in Arabidipsis 
Ih^fiana of an E. co/t GPAT and a modified safflower GPAT has been demonstrated. 
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Table 1. Oil content of wffd type ancf transformant ArQbicfopsIs thaiian^ seeds 



Sir 



Sample 


Transformant 


OHC 
(Wt%) 


ronlent 

% fncrease 


Wt/iOO seeds 
<nig) 












Wild type 


Average (n«7) 


26.9 ± 0.8 




1.42 ±0.29 












pHS737 


Average (n=7) 


26.8 ±1,09 




1.41 ±0J26 












cipgpat 


315-2 


32.4 


20.9 


2.47 




315-3 


29.4 


9.7 


1.92 




315-4 


30.1 


12.3 


1.90 




315-7 


29.6 


10-4 


1,85 












cipgpaf - ip 


301-2 


34.0 


26.9 


2.03 




301^ 


34.6 


29.1 


1.90 




301-5 


29.3 


9.3 


1.79 












ctpgpat-tp^errs 


302-2 


29.4 


9.7 


1.44 




302-6 


32,7 


22.0 


2.45 












pisB 


303-2 


29.8 


11.2 


1.28 




303-3 


30.8 


14.9 1.98 




303-4 


29.1 


8.6 1 1.89 




303-7 


33.2 


23.9 


1.63 












plsB errs 


304-1 


30.9 


15.2 


1.51 




304-2 


32,5 


21.3 


1.60 




304-3 


30.3 


13.1 


1.38 




304-15 


32.5 


21.3 


2.20 
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Table 2, Fatty Acid Composition of Wild Type and Transformed Arabidopsis thaliana 



m 
m 



f2i3 

o 



Samp{a/6onstruct 


Wt Vt Oil 


18:1 


18:3 










Wild typa 


26.9 ± 0.8 (n=7) 


14.4 ±0.7 


15.7 ±0.8 










pHS737 


26.8 ±1.1 (n=6) 


12.4 + 1.3 


1f5.9±0.4* 


















315-2 


32.4 


13.6 


18.4 




30 1 


14.7 


16.5 






9.1 


18.4 


oio-o 


^# .1 


Q 9 








10 A 


17 S 










spspac-ip 








3U1-I 


ZT.o 


■fO A 


1 » .1 


OU 


OA.U 


iO Q 


1fl A 




o*t,o 


15 2 


18.2 










sp Qpa t*tp+efTs 












liC.O 


1 f% 








i7 *; 

1 r .O 


302-6 


32.7 


12.9 


17.5 


302-7 


27.8 


15.8 


16.5 










pisb 








303-1 


28.4 


12.0 


17.8 


303^ 


30,8 


11.4 


17.6 


303-4 


29,1 


14.7 


17.1 


303-7 


33.2 


14.8 


15.9 










plsb + errs 








304-2 


32,5 


12.4 


17.0 


304-3 


30.3 


12.1 


16.8 


304^ 


23.5 


10.7 


16.0 


304-9 


27.4 


13.9 


17.4 


304-15 


32.5 


14.1 


16.9 
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Flgura 1* Nucleotide sequence of the gene ccpgpatlCajrthAitixjs cinctorius 
plascidiaX) used for trans forctiiri? yeast and AJCAbidops^s c/saii^/id. Th« scart 
and scop codoci« are u&derlined. Nucleotides chat are different from the 
published secruence are highlighcsd- 

TCTCTCTCTCrCACACACAACACACAAAAXIACAWCTACTGCTACTTTCTCTCrCT^ 

TCCATCTTCTTCTCTCCTTCCTCCCCTACTCTCTTCTTCTCC^CCACi^CGCAAATCCTAaGGTTTCTCCTTC^ 

TCACCTTCTTCTGCCTTCACTCCTCCTCTGTCTTCTTCTCGCCTCCGCCCGATTCTCCGGGSGrrTCCGTGCCTCGCC 

XTCrCTGCGCCGCCGAATGCCGCCCATGGCACCGCGCAGACCCTCCACGGCAACAAGTGCCCGTCACCGTCGTCCTCC 

rcCTCTGCTGCTACGCAACCGTCCGCTGGATCCGACCACGGTCACTCTCCXACATTCATCGATGCTCGTTCCGW^C^ 

GATCTTCTTTCTGGAATTCJWVGAGAGTTGCAAGCrGGAACACTGCCAAAACArATTGCTCAAGCAATGGAGGAG^ 

TATCAGAACTACAAAAATGCAGTTCTCCAAAGTGCGGCTCCTCATGCAGAAGAXArTGTGTTGT»^ 

GCGXTTGWCGIATGTTCTTGGAXGTGAAGGACCCGTTTCSAATTTTCACCATAICATGAAGCTAT^ 

AACTACTAXAXGXTTGGXCAAAATXAXAXTCGGCCTTIGGTCAATTTCAGGGAAICATACCBTTGGCAATGTCXCCGTX 

TTCGGTGlAAtGGAAGAGCAGCXTAAGCAGCGXGACAACGXGGXTTIGATCrCAAACCATsyAAC^ 

GCTGTTATTGCCTXGAXGCXTGAAACAACAAACCCCCATAfTTCTGAGAACAXAATCXAC'TrGGCAGGGGAX 

ATAACAGATCCTCTTTGCAAGCCTXTCAGCATGGGAAG6AATCIGrXGTGCGTGIATTCA^I\AAAGC^ 

GTTCCTGAGCXTGCTGAGAXGAAAAAAAGATCAAATACAAGAAGTXXAAAAC^JIPI^ 

ICTAAAATAATATGGATXGCGCCAAGXGGXGGCAGCGACAGCCCACArcCTATCACAAATCACTCGTTTCCGCCAC^ 

XTTGATGCCACTTCGCTT6ACAACATGAGAAGGCTCGXGGACCATGCIGGXTTGGTGGGT»aUCAT^ 

AXAXTGTGCa\XGACATCATGCCCCCTCCTCTTCAGGTTGAGAAAGA?VATXGGAGAGAAGAGggiSAT 

GGCACCGGAATATCAGTGGCACCGGAAAXXAAXTTCXrAAGAAGXTACTGCCXCrTGTGGGTCCCCCG^ 

GCACCTTAITCACAGGCACTCXAXGATTCCGXGTGTGJ«VCAATACAAGGIGCTACAXrCTGCGCTA<»TGGAGGAAAA 

GGGXTASAAGCATCAACACCAAGYGXCXCGTXGTCJVCAACCCTTGCAGXXTCXCGATTACTCTCTTCGXTTAGA 

GTGWiAGCATATTCirrrGTTXAGATGACAXAGGrGTATAGATGArACCGAACAAXAGAI'GTACAAA 

AAGATGTATGTCrAAXCAAAAAATGXTTTCTG<yvXCrTGXAAACGGATCTTCAAAACAa^CCXXTTAIXTTAGCXGCA 

CCAACCAAIAXATCAAAACAGCTrrrrCTTXTT 



Xhe traiLs laced protein seq^uence of the gaae etpgpac used for trans to ming 
yea-ac and ArAhXdapsXs chailana. Amiixo acids that are differenc from the 
published sequence are highlighted. 



sr.SLXHNTONTJnrCYFLS LLHS PIAMS IFFSPS S PTLFFSTTKAOTRVSPSS S PS S ATl-TPiS SSIOllF ri^^ 

F5APAhOSAHGXAETVHGNKMPSPSSS5SAAT0PSAGSDHGKSRirinARSBQDLZ»SGI(iR^ 

YQ^^yKNAVI42aAAPKA£DIVI,SN^®VAFDR^lFlJ^^^ 

FGV>lSfiQtJCQGDNV\^ZSNHQTSADPAVlALMLETTNPHI SEKI IYVAGDJlVrTDPLCKPFS>aGRNt.LCVTf SKKHMND 
VPEIASMKKRSNTRSLI®<AlXLRGGSKIIwrAPSGGRDRPDPIX>»QOTPAPFnATSL;^^ 
ILCH0IMPPPLQVEKEICTiqg|lSPHGTGl5VAPErNFQEVTASCCSPEEAKWvYSQAt.YDSVCEQV^^ 
GI.EA3TP3V5I,SQPMPLD 




Fiaxxre 2. Nuclaottde seijuence of Che qeM ccpgp4C-tp (Cartha/nus Cinccoxius 
plascidial GPAT from which the se<TUence Cor & putative transac peptide was 
removedl used for cransf orroing yeast and AraMdopsia t/>dlidnd. The start and 
stop codocis are underXinad. ATG was introduced *s a start codon in place of 
GAC, Nucleotides that are different tram the published se^juence are 
highlighced- 

CGATCCATGCACGGTCACrCTCGTACATTCATCGATCCTCCrrCCGAACAAGATCTTCTTTCTGGAATTCAA^ 
CAAACTGCGGCTCCTCATCa«3AA(y^ATTGTGTTGTa\^ 

AAGGAGCCGTTTGAATTTTCACCATATCATGAAGCTATTTTGGAACCTTTTAACXACTATATGTTTGGTCAA^ 

ATTCGGCCTTTGGTCAATTTCAGGGAAirCATACGTTGGia^TCTCTCCGXTTTCGGTGTA^.TGGAACAG^ 

cyvGGGTGAOtAGGTGGTTTXGATCtCAAACCATcaW^CAGAAGCAGArCCJ^ 

ACAAACCCCCyVTMTTCTGAGAACATAATCTACGTGGCAGGGGAXAGAGTAATAACaiGATCCTCTW^ 
A6CATGCGA?U:K3JATCTGTTGTGCtn'GTATTe3WyJWV6a 

AGATCAAATACAAGAAGXrrAAAAG^gTGGCirrTGCTXrrGAGGGGCGGATCTAAAATAnTATGGATTGCGCCAA^^ 

CGTGGCMCGACAGGCCAGATCCTATCACAAATCACTGGTTTCCGGCACCGTTTGATGCa^C^ 

AGAAGCCTCGXGGACCATGCTGGTWGGTGGGTCACATATATCCTTTAGCCATArXGTGCCATGACA^ 

CCTCTTCauSGTTGAGAAAGAAATTCGAGAGAAGAf^^gGATCTCTTTTCATGGCACCGGAATATCAGT 

ATTAATTTCCAAGAAGTTACTGCCTCTTGTGGCTCCCCCGAGGAGGCGAAGGCAGCXTAITCACAGGCACTCTATGAT 

TCCGTSTGXGAACJU^TACAAGGXGCXACAXXCTGCGGTACAXGGACW^AAAGGGXTAGAP.G 

T CGXTGTCACAACCCTTGCACXrXCTCGATXAGGAXCC 



Xhe tran5iaCed procein setjuence o£ the jene ctpsEpat-cp u^ed for transforming 
yeast and Ar^idopsxs thaljCana. T he mature polypeptide will not have the 
first 90 amino acids of the etpjpat IBheila and MacKenzie, 19941 . And.no acid 
91, Asp {D)/ is replaced with Met (M) - Amino acids that ara different from 
^he published sequence are highlighted. 

GSMaGKSRXFlDARSEQDIXSGIQRELEAGXLPKKIAOrtffiELYQNYKWAVI^ 

KEPFErSpyHEAXLBPFWYYMPGQNYJRPLVWFRESWGWSVTGVMEEQLKQGDlOA/^ 

TNPHlSEWlIYVAGDRVITDPLCKPFSMGRNIJifCWSKKHMOT^ 

GGRDRPDP rTWCWTPAPFDAXSLONMRiaVDHAGLVGHllf PIAILCHDlMPPPMVra ISVAFE 
njPQEVTASCGSPBEWCAArSQALTOSVCBQyKVLHSAVHGGKGLEASTPSVSLSQP WFtD . O 
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Figure 3. Nucleocide sequence of the gene ccpgpat-tp+errs t CircJiaiiiui 
ta/iccoWuj plaacidial GPAT from wKich the se<Tuenc« for ft putative transit 
peptide removed and sequences for endoplasmic xecicuXiun xecention sl9aal5 

were added). Stare and scop co<io2vs are underlined. ATG was rntroducad as a 
scare codon in place of GAC. Additional ly, AAC and AXA (also underlined! in 
place of tTG and TTX were introduced for ER retention of Che protein. 
Wucleotides that are different frnm the published sequence ars higniighted. 

GGATCCATGCACGGTC^CTCTCGTACATTCATCGATSCTCGTTCCGTACAAGATCTTCTrTCTGGAATTCA^ 
TTGaOKGCTGCPAOiCTGCZAfJ^CMhTtCC^ 

OUlASTGCGGCTCCTCyvTGCAGAAGATArrGrGTTGTCAAACXTGCCrGTAGCGTTTGATCUSTATGTTC^ 

AACGAGCCGTTTGAATTTTCACCATAl'CATGAAGCTATTTTGGAACCTTTTAACTACTAT/vTGTTTGGTCAA^ 

ATTCGGCCTTTGGTCAATTTCJwSGGAATCATACGTTGGCAATGXCTCCCTrrTCCGTCTX^T 

CAGGGTGACAAG6TGGTTrrGATCTCAAACCATCAAA(yvGAAGCAi3ATCCACCTG 

ACJU^ACCCCCATATTTCTGAGAACATAATCTACGXGGCAGGGGATAGACTAArAACAGATCCTCT 

AGCATGGGAAGGAATCTGTTGTGCGXGTATTCAAAAAAGCATAXGAATGATGTTCCXGAGrTTGCTGAGATGAAAAAA 

gii agatcaaatacaagaagtttaaaag3^^ggctttgcttttgaggcgcggatctaaaataa.^ 

^ ck?tggcagggacaggccagatcctatcacaaatcactggtttc cggcaccctttgatgccjicttcgcttqacaacatg 

, . agaaggctcgtggacimgctggattggtgggtcacmatatcctttagccatartgtgccatcacatcatgccc^ 

^ CCTCTTCAGGTTGAGTVAAGAAATXGGAGAGAAGTiGg^SATCTCXTTTCATGGCACCGG^ATAXCAGTGGCACCGGAA. 

ATTAXTTTCCAAGAAGTTACTGCCTCTTGTGGGTCCCCCGAGGAGGCGAACGCAGCXTATTCACAGGCACTCXATGA.T 
XCCGTGTGTGAACAATACAAGGTGCTACATTCTGCGGTACATGGAGGAAAA6G6TTAGAAGCATCAACACCAAGTGTC 

^J: TCGTTGTCACJ^CCCAAGCJUSAftfiCTCGA T-fflg GAXCC 

Q 

Uj! Translated protein sequence of the gene ccpgpat— Qa+err*. Xae mature 

s polypeptide will not have ch^ first 90 aniino acids of the ctgpat (Bhella and 

Q MacKensie^ 1994). Amirvo acid 91. Asp (D) , is replaced with Met (M) . Also hys 

l^jj {X}. underlined, replaced a Fhe iT) And L«u (L} at position 3 and S Crota the C 

terniinu«. A&lno acids that are different from the published sequence are 

M* highlighted. 

0^ 

ri' GSMKGHSRTFinARSEQDLLSGlQRBtEAGTLPKRIAQAMEEtYOWTOIAVLQSAAPH?^ 

if KB PFEFS p■n{BAII.BPF^XYWrGQIfY IRFLWn«:5YVGKVSVFG\WEEQLKQGDKVV^^ I SNHQXfiADPAVIALMLET 

^ TNPKISEa^IIYVMORVITDPLCKPFSHGHNLLCWSKKHMKIAfPBIAaiK^^ 

GGRDRPI)PrTNQWFPA?Fr>AXSXDrWI«U»Vl)ttAGI.VGRiyPrMLCHDIOTPPIiQVEKEIGEI^lS 
lOT0ZVTASCG5PEfiAKAAYSQALYDSVCEQYKVLHSAVHGGKCLEASXPSV3I#SQPK5KLD . D 



IT 



rimire 4 iJucleotide sequence of tha gene plsM (E-coii GPAT) Stare and scop 
cod^s underlined. A nucleotide chat is different ftom T:he published 

secjuetiee is highlighted. It was introducad durincf PCR, 

AGATCTTCCCATCACTTTCTGCTATCCTTGCCGCGCATTTGCATTATTAACCAGAGGCTTTX^ 
CTGGCa^CCAATTTACTACAAATTACTGAATTrAC^ 

TGCCCCGGAACTGGGCCTGGATACCTCXCGTCCAATTATGTACGTTTTACCGTACAACTCC.AAAGCAGAm 
CTTGCGCGCCCAGTGXCTGGCACAr<»^CTTGCCTGACCCGTT^^ 

CtATGTGTTCATTCACGGCGGGCCGCGT6TGTTCACCTATTACACGCCGAAAGAAGAGTC7:ATXAAGCTG^ 

CTATCTCGATTTGCACCCTAGCAACCCAAATCTGGATGTGCaUSAlGGrGCC^^ 

TOG6CGTGft?JVAAGGCGAAGTGWVCCCGCCGCTCCGTATCCOTAACGGCCTAC^^ 

Sgtcgcgacagttttgtgcgtttctcgccgtcagtttcgctgcsccgtatggcggatgaaca^^ 

TWcicTWGAAACTGGCGCGCGTGGCGCGTAXGCACTT^^^ 
T^TCGTCMGXTCTGTrrAATAAGCTCCXCGCCTCCCCCGCCATTGCCAA^ 

aaSatctcccatgaaaaagcgcagcagaacgcgattgcactgatggaaga^^ 

GATTCGCCTGACXGACCGTXTTCTGCGCTTCACCTGGAACCGACXTTACCAGGGCATCAftCGTCC^^ 

CGTTCGCCAGCTGCCC»CGACGGTCArGAGCTCGTATATGTGCCTTGCCACCGCAGTC^.CATGGACT^^ 

TTCTXACGTSCTGTATCACCAGGGGCTGGrGCCGCCGavTATCGCCGCCGGGAT^ 

GCCGATTTTCCGCCGTCTGGGGGCCTTCTTTAXTCCCCGTACGTTTAA^GGCAATAAACTTTATrcCACCGTT^ 

GGAGTATCTCCGC6AACTCTTCAGCCGTGGTTATrCCGXCGAGTACTTCGTGCAAGGC6<;TCGXTCCCGTACGGGGCG 

TXXGCXGGAXCCGAAAACn^GGTACGCTGXCGATGACCAXTCAGGCGATGCTCCGrGGCGGCACGCGXCCG^^ 

MXCCGATCTAXATCGGTTATGAGCACGTOVrGGAAGXGGGXACTTACGCCAAAGAACrGC^ 

GAAAGAGAGCCIGCCGCAGAXGCTGCGCGGXTXAAGCAAGCTGCGXAAXCTCGGTCAG^^^^ 

ACCAATGCCGTTGATGACCTACCXXAACC:AGCAXGTACCXGACTGGCGXGAAXCXAXaS^TCCCAXC6AAGC<^XC 
TCCGGCAXGGTXAACGCCGACGGXCAATAAIATTGCXGCCGAXCTGATGGTACGCATTAACAACGCAGCCGCOGC^ 
CGCCATGAACCXGTGCXGTACTGCGCXACTGGCATCACGXCAGCGCICACXCACCCGCGAGCAGTTAACCGAGCAACI 
CAACXGCXACCXGGATCTGATGCGCAACGTGCCCXACTCCACGGACTCTACCGXXCCTTCAGCCAGCGCCAGCGAGCT 

TATCGATCACCCGCXGCAAAXGAACAAGXTXGAAGTCGAGAAACAOVCAAXCGGCGAO'.TCATCA 
GCAAGCGGIGCXGATGACCTACXAXCGCAACAACAXXGCGCAXAXGTTGGXGCTGCCXTCGCTGAXGGCt^CAAICCT 

CACCCAGCArcGCCACATCTCCCGCGACGTATXCATGCa^GWCCTCAA^ 

GXTceXGCGCTGGGATCGCGACGAGXTGCCGGACGXTATTGAXGCGCXGGCAAATGAGATGCAACGTCAGGGGCXGAX 

XACCCXGCAAGATGAXGAGXTGCATATCAACCCGGCGOVrrCrrCGCACGCTACAGCTCCIGGCCGCAGGC^ 

AACGCTGCAACGTXAXGCCATCACCTXCTGGXTGXIGAGTCCCAACCCGXCGATCAACOGCGGTACGCTGGAGAAACA 

GAGCCGCACCGXCGCGCAACGXCTCXCCCXGCTSCACGGCATCAACGCGCCGGAGTTCTTCGACAAGCaSGXGXXCAG 

TTCTCXGGlGCXGACACXGCGTGAIGAAGGGTATATCAGCGATACCGGCGAlGCCGAACCCGCAGAAACGATGAAGGr 

XXAXCAGXXGCXGGC6GAGTXGAXXA»XCAGACGXGCGXITGACGATTGAGACXGCGACGCAGGGCGAAGCGTA^ 

AGAXCX 

Translated protein secpience of the gene plsB (£- coii GPAT) . An amino acid 
that is different from che pub li shad aetjuence is highliqrhr-ed, 

|^FCyPaUlFALX.XRG?TSEMSGMPRIYyKIXja,PLSIlVKSK5lPADPAPEW^ 

QClJ^£)LPDPLBPLEIDGTLLPRrs^IHGGPRVFX«TPKEESIKLFHDYlJ?I^I^NT«LDVQ^^ 

KGEVNPPLRMITOVQKFFAVLWLGRDSr^^SPSVSIJFlRMADSKGTDCT 

DI.nWIXASRArAlCAVEDEARSKKI5HEKACQNAlAIMEEIAAI*PSrE»llRLXDRIMr^^ 
Ijan>GK2rVYVPCKRSHMDYI.IiSWLVHQGLVpPHIAAGJNIOTWPAGPIF^^ 

CELPSRGysVEYFVBGGRSRTGRLLDPKXGTLSMrlQMeRGGXRPIXLIPlYIGYrHVMEVGTYAKELJ^^ 
LPQMLRGI^KLBKliGQGVVMFGEPMPIJylTni/QHVPDWKSSI 

IrCCXALIASRQRStTREQIiTEQIlJCYtDIJJCRNVPYSXDarVPSASASBLIDHALQMJ-a^ 
IJ^trrmiNIAHMLVLPSIWJ^IVTQKRHISRDVIWEHV^ 

D0EI,HrKPAHSRXLQLIJ^GABEILQRYAIXFWI.MANPSINRCTl^KBSRTVAQR3.SVX 
LTtRDBCYXSDSGnAEPAETMKVyQLlAEI.ITSDVRZ.XIESATQGEG . SV 
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Fi^re 5. Nucleocide sequence of Che gene pJ.sB-t'erXS lE.coli OPAT Co which 
sequences iox endoplastnic roticuluxa retention signals were added) . Star^ and 
stop codons axe underlined. AAC and AA6 (alao underlined) In place of GCC and 
ACG were introduced at the 3* end for ER retention of protein. 

AGATCTTCCC ATG ACTTrCTGCTATCCTTCCCGCGCATTrgCArrATTAACCACACGCTTTACATCGTTTATGTCCCG 
CTGGCCACGAATTiy^CTACAAATTACTGAATTTACCATTAAGCATCCTGGTAAAAAGCAAGTCrAITCCGCCAGATCC 
TGCCCCGGAACTGCGGCTGGATACCTCTCGTCCAATTATGTACGTTTTACCCTACAACTCGAAAGCAGATTTGCTGAC 
GTTGCGCGCCCAGTGTCTGGCACATQACXTGCCTGACCC6TTA6AGCCGCTGQAAATCGACGCCACGCTACTGCCGCG 
CTATGTGTTCATTCACGGCGGGCCGCGTGTCrTCACCTATTACACCCCGAAAGAAGAGTCTATTAAGCrGTTCCACGA 
CTATCTCGATTTGCACCGTAGCAACCCAAATCTGGATGTGCAGATGGTGCCACTGTCGGTGATGTTTGGTCGCGC 
GGGGCGTGAAAAAGGCGAAGTGAACCCGCCGCTGCGTATGcrrAACGGCGTACAGAAATTTTTCGCTGTACTGTGGCT 
CGGTCCCGACAGTTTXGTGCGTTTCTCGCCCTCAGTTTCGCTGCGCCGTAXGGCGGAXGAA.CACGGCACCGATAAAAC 
TATCGCTCAGAAACTGGCGCGCGXGGCGCGTATCCACTTT6CCC6TCAACGTCXGGCTGCCGTAGGCCCACGTXTGCC 
XCCTCGTCAGGATCTGTTTAATAAGCTGCTCGCCTCCCGCGCCAXXGC CAAAGCGCTAGA AGATGAAGCGCGCAGCAA 
AAAAATCTCCCATGAAAAAGCGCAGCAG»^ACGCGATTGCACXGATGGAAGACAXTGCGGCGAATTXCXCITACGA» 
GAXTCGCCTGACXCACCGTATTCTGGGCTTCACCtGGAACO^CTTXACCAGGGCATCAACGTCCATAACGC^ 
Ql CGTTCGCaiGCXGGCCCACGACGGT»XGAGCXG6TATAlCXGCCXXGCCACCGCACTCrtCATCGAC:TACCXG 

XXCXTACGXGCTGTATCACCAGGGGCXGGXCCCGCCGCATATCGCCGCCGGGATCAACCrGAATTXCTGGCCTGCCGG 
GCCGATXTTCC6CCGTCXGGGGGCGTXCTTXATX CGCCGXACGXXXAAAGGCAATAAACTXXATT CCACCGTTTTCCG 
G6AGTATCTCGGCGAACTGXXCAGCCGTGGTTAXTCCCXCGACXACXTCGTGGAAGCCGGXCCXTCCCGTACGGGGCC 
01 TXTGCTGGATCCGAAAACXGGXACCCXGXCGAXGACCATTCAGGCGATGCXGCGTGGCGGCACGCGTCCGATXACGCT 

p :] gatxccgatctaxatcggxxatgagcacgxcatggaagtgggxacttaggccaaagaacxgcg cggcgcgacgaaaga 

jt^ GAAAGAGAGCCXGCCGCAGAXGCXGCGCGGTTXJ^GCAAGCTGCGXAATCXCGGTCAGCGrXACGXCAACXTCGGXG^ 
if ACCAAXGCCGXXGAXGACCXACCXTAACC^^CAXCXACCTGACTGGCCXGAAXCTAXCGAXCCCATCGAAGCGGTGCG 
^ TCCGGCATGGTTAACGCCGAC GCXCAATAATATTGCTCCCGAXCXGAXGGXACGC ATXf-ACAACGCAGGCGCGGCAAA 

yi; CGCCATGAACCTGXGCXGXACTGCGCTACTGSCATCACGTCACCGCXCACTCACCCGCX4AGCAGTTAACCGAGCAACT 
H CAACXGCTACCTGGATCTGAXGCGCAACGTGCCCXACTCCACGGACTCXACCGTTCCXI'CAGCCAGCGCCAGCGAGCT 
TATCGAXCACGCGCXGCAAAXGAACAAGXTTGAAGTCGACAAAGACACAAXCGGCGAC.\rCAXCAITCXGCCGCGCGA 
GCAAGCGGXGCTGATGACCTACXAXCGCAAaVACAXXGCGa^TAXGXXCGXCCTGCCrrCGCTGAXGGCGGCAAICGX 
i CACCCAGCAICGCCACATCXCCCCCGACGTATTGATCGAGCyWrGTCAATGTGCXTTACCCAAXGCXGAAAGCGGRGCT 
^1.- GTXCCTGCGCTGGGATCGCGACGAGXTGCCGGACGXXATIcaVXCCCCXCGCAAAXGACATCCAACGTCAGGGGCTGAT 
g?, TACCCTGCAAGATGATGAGXTGCATAXCAACCCGCCGCAXTCrCGCACGCTACAGCXGCXGGCCGCAGGCGCGCGCGA 
AACGCTGCAAtCTTATGCCATCACCXXCTGGTTGTTGAGTGCCAACCCCTCCAXCAACCGCGGIACGCTGGAGA^ 
GAGCCGC\CCGXCGCGCAACGTCXCXCCGXGCXGCACGGCAXCAACGC6CCGCAGTXCTTCGACAAGGC6GIGXXCA^ 
TTCTCTGGXGCXGACACXGCCTGAXGAAGGGXATAXCAGCGATACCGGCGATGCCGA'iCCGGCAGAAACCAXGAAGGT 
TXAXCASXXCCTGGCGGAGTTGAXXACAXCAGACCXGCGTTXGACGAXXGAGAGXGCf SAAG CAGft^ 
AGATCT 



£3 



Xran^lated protein sequence of the 9ene pXsB-h&rrs^ Amine acid Lys (K) « 
undexlinedr -replaced a Gly (G) and Thr (T) at position 3 and S from the C 
text&lnus. 

HTFCYPCBArAI.LTRGrTSTOSGMPRlYm»JJ?LPLSILVKSK3IPADPAPEax; 

^CLAHDL»0PI.BPX.ElDGXI,LPRWPIHGGFRVrrrXTPKEESIJarHDXII>UJR5 

KGEWPPLT«<U*GVQKFrAVI/Wt,GRD5rVRrSPSVSLRRMADEHCXD5Ttt 

DI.nnaiASSAXAlCAVBI)SAKSKKXSH£KACQ^AIAI>f££ lAANFS RLXBS Z£,GFTW>niLVQG7HVHKASK^/BQ 
rjUfDGHELW^CHRSH>©rLIXSWl,TraQGt.VPPHlAAGINI2»™PAGPIF»RlGA5' 
GEI^SRGrSVSyFVEGGR5RXGiaLDPKTGXI»SKrrOAKLRGGTMXTlIPlYIG>EHV>4E>^ 
LPCMLRGLSlCC,RlCLGQGyVOTGEP*lPIOTyLMQH\rPI)Wl^SIDPIEAVRPAWI,TPTV^^ 

LCCXAI.IA5J^QR5LTREQLTEQl2JCYXniMtNVPySTDSXVPSASASBLlDKAL0HW^ IGDI I ILPREOAV 

I«y!CIl^wIAH^fDVI.psI^IAArv^rguw^ISlUJ^/r^^ 

DDEIJ^NPAK5RTI^IXAAGAJ^TLQRyAIXF^aI»SA^^PsrN»GXLCK£SRXVAQ•^SVLHGI^^APEF^t)^ 
LTI^RDEGYISDSGDAE PAETMICVYO tlAELITSOVRLTl ESAKQKEG 
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